ProbCons: Probabilistic consistency-based multiple sequence alignment.
نویسندگان
چکیده
To study gene evolution across a wide range of organisms, biologists need accurate tools for multiple sequence alignment of protein families. Obtaining accurate alignments, however, is a difficult computational problem because of not only the high computational cost but also the lack of proper objective functions for measuring alignment quality. In this paper, we introduce probabilistic consistency, a novel scoring function for multiple sequence comparisons. We present ProbCons, a practical tool for progressive protein multiple sequence alignment based on probabilistic consistency, and evaluate its performance on several standard alignment benchmark data sets. On the BAliBASE, SABmark, and PREFAB benchmark alignment databases, ProbCons achieves statistically significant improvement over other leading methods while maintaining practical speed. ProbCons is publicly available as a Web resource.
منابع مشابه
PROBCONS: Probabilistic Consistency-Based Multiple Alignment of Amino Acid Sequences
Obtaining an accurate multiple alignment of protein sequences is a difficult computational problem for which many heuristic techniques sacrifice optimality to achieve reasonable running times. The most commonly used heuristic is progressive alignment, which merges sequences into a multiple alignment by pairwise comparisons along the nodes of a guide tree. To improve accuracy, consistency-based ...
متن کاملMSAProbs: multiple sequence alignment based on pair hidden Markov models and partition function posterior probabilities
MOTIVATION Multiple sequence alignment is of central importance to bioinformatics and computational biology. Although a large number of algorithms for computing a multiple sequence alignment have been designed, the efficient computation of highly accurate multiple alignments is still a challenge. RESULTS We present MSAProbs, a new and practical multiple alignment algorithm for protein sequenc...
متن کاملSPEM: improving multiple sequence alignment with sequence profiles and predicted secondary structures
MOTIVATION Multiple sequence alignment is an essential part of bioinformatics tools for a genome-scale study of genes and their evolution relations. However, making an accurate alignment between remote homologs is challenging. Here, we develop a method, called SPEM, that aligns multiple sequences using pre-processed sequence profiles and predicted secondary structures for pairwise alignment, co...
متن کاملMultiple Sequence Alignment Tools: Assessing Performance of the Underlying Algorithms
Multiple sequence alignments have primary role in several domains of modern molecular biology such as protein 3D structure/function prediction, phylogeny inference, molecular function, intermolecular interactions and many other common tasks in sequence analysis. Presently, many tools to construct multiple sequence alignments are available but none of them is accurate for all types of data sets....
متن کاملPROMALS: towards accurate multiple sequence alignments of distantly related proteins
MOTIVATION Accurate multiple sequence alignments are essential in protein structure modeling, functional prediction and efficient planning of experiments. Although the alignment problem has attracted considerable attention, preparation of high-quality alignments for distantly related sequences remains a difficult task. RESULTS We developed PROMALS, a multiple alignment method that shows promi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Genome research
دوره 15 2 شماره
صفحات -
تاریخ انتشار 2005